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Abstract 

Given the univariate marginals of a real-valued, continuous-time martingale, (respectively, a family 
of measures parameterised by t £ [0, T] which is increasing in convex order, or a double continuum of 
call prices) we construct a family of pure-jump martingales which mimic that martingale (respectively, 
are consistent with the family of measures, or call prices). As an example, we construct a fake Brownian 
motion. Then, under a further ‘dispersion’ assumption, we construct the martingale which (within the 
family of martingales which are consistent with a given set of measures) has the smallest expected total 
variation. We also give a path-wise inequality, which in the mathematical finance context yields a model- 
independent sub-hedge for an exotic security with payoff equal to the total variation along a realisation 
of the price process. 


1 The problem 

In this article we are concerned with the following problem: given a martingale M = ( M t )o<t<T on a 
stochastic basis (Sl M ,X M ,¥ M ,¥ M ), construct a fake version of M (or equivalently a process which mimics 
M), i.e. construct (potentially on a new filtered probability space (SI, T, F, P)) a stochastic process X = 
(X t )o<t<T such that A is a F-martingale and the univariate marginals of X are the same as those of M, but 
such that the joint marginals are different. 

The problem can be reformulated in two further ways. First, instead of beginning with a martingale 
M, we can begin with a family of laws (nt)o<t<T which are increasing in convex order. Then the aim is 
to construct a filtered probability space (SI, T, F, P) and a F-martingale X = (A t )o<t<r on that space such 
that P(X t < x) = Ht((— oo,x]) for all t and all x. Then we say that X is consistent with the measures 
(Ht)o<t<T■ Second, but closely related, instead of beginning with a process or a set of measures we can 
work in the setting of mathematical finance and start with a double continuum of European call prices 
{C(t, k); 0 < t < T, 0 < k < oo} which satisfy no-arbitrage conditions. (We assume that we are working with 
discounted prices, and then the no-arbitrage conditions are that: for each t , C(t,k) is a decreasing convex 
function with C'(t, 0+) > 1 and lim^oo C(t, k) = 0; C(t, 0) is a positive constant, independent of t: and 
C(t, k) is non-decreasing in t , for all k.) Then the aim is to find a model which is consistent with the given 
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call prices, i.e. a filtered probability space and a martingale X on that space such that E[(X t — k) + ] = C(t, k) 
for all t and all k. Then we say X is consistent with the family of call prices { C(t , fc)}. 

For a given martingale (M t )o<t.<T (or family of laws (/ iit)o<t<T or call prices (C(f, fc))o<t<T,o<fc<oo) there 
will in general be many martingales X which mimic M (are consistent with the family of measures or call 
prices). (One rare exception is if /zt is the uniform measure on the two point set {—f, t} for each t.) Then an 
extension of the mimicking martingale problem is to find an extremal martingale with the correct marginals. 
Thus, given a path-dependent functional, for example the total vaiation of the path, the aim is to find 
the martingale which minimises (the expected value of) the path-dependent functional within the class of 
martingales which have the given marginals. A strong motivation for considering problems of this type is 
from mathematical finance where there is a literature on robust (model-independent) pricing and hedging. 
Typically in the modelling of financial security prices we do not know the underlying dynamics, but we can 
observe put and call prices, and from these we can infer the market-implied law of the security price at 
maturity times t. If a double continuum of European vanilla option prices is available, then we may assume 
that the family of laws (nt)o<t<T is given and the problem is to construct processes which are consistent 
with this family. Further, since the mathematical theory of option pricing postulates that security prices are 
given as discounted expectations under an equivalent martingale measure, a natural additional requirement 
is that the price process is a martingale. Then the lowest expected value of the path-dependent functional 
(taken over models consistent with call prices) is identified with the lowest arbitrage-free price for an exotic 
derivative. 

The literature on these problems can be traced back to Gyongy m, Dupire [9], Madan and Yor m, the 
literature on fake Brownian motions (Hamza and Klebaner J0], Albin [2], Oleszkiewicz ESI), Pcocs (Hirsch 
et al pS]) and most recently and most relevantly for this paper, Henri-Labordere et al [T6j and Kallblad et 
al [25) . In addition there are strong connections with the literatures on the Skorokhod embedding problem 
(Sep), see Obloj [28] and Hobson 20] for surveys, martingale optimal transport (Mot, Beiglbock et al [5], 
Beiglbock and Juillet ED, and robust hedging of options (Hobson |TiH [20] ). 

Given an Ito process, Gyongy m showed how to construct a diffusion X with the same univariate 
marginals. Dupire [S] took a finance viewpoint and argued that there is a unique martingale diffusion pro¬ 
cess which is consistent with any sufficiently regular set of discounted call prices. Madan and Yor m looked 
for Markov martingales which are consistent with a given set of univariate marginals and give several con¬ 
structions inlcuding one based on the Azema-Yor solution of the Sep. In the case of Gaussian laws, this 
construction yields a discontinuous fake-Brownian motion. Unaware of this result, Hamza and Klebaner [14j 
introduced the fake Brownian motion problem, and gave a further example of a discontinuous martingale 
with N( 0, t) marginals. Their paper inspired Albin [2] and Oleszkiewicz [29] to produce very elegant con¬ 
structions of continuous non-Markovian martingales with N( 0, t) marginals. More generally, the problem of 
constructing processes consistent with a given set of marginals (which are increasing in convex order) was 
called the PCOC problem by Hirsch et al [T5]- They give several general methods and examples. However 
there is much scope for further solutions especially those with optimality properties. 

From the mathematical finance perspective the study of extremal consistent martingales can be traced 
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back to Hobson nsi> see also the survey article [20] . In this literature a path-dependent functional is 
interpreted as the payoff of an exotic option and then the lowest price which is consistent with a given set 
of call option prices and no-arbitrage is the minimum over all consistent martingale models of the expected 
value of the path-dependent functional. One of the significant achievements of this literature is to understand 
the relationship between this primal pricing problem and a dual hedging problem. In the dual problem the 
aim is to find strategies which satisfy 

Exotic Option Payoff > Payoff from Vanilla Options + Payoff from Self-financing Strategy. 

Then we can represent the lower bound on the price of the exotic option in terms of the most expensive 
sub-hedging vanilla option portfolio (the self-financing part being costless under any martingale model). 
The fact that European call options (which have prices which depend on univariate marginals) are often 
liquidly traded, whereas derivatives with payoffs depending on two-or-more maturities (e.g. forward starting 
straddles or barrier options) are relatively less liquidly traded, provides a strong motivation for considering 
mimicking problems where the aim is to match univariate marginals, but not joint marginals. 

One approach to solving the mimicking problem is to exploit links with the Sep. Given a stochastic 
process Y and a target law ft. the Sep is to construct a stopping time r such that Y T ~ ft. The classic 
setting is when Y (which we now write as W) is a Brownian motion null at 0 and ft is a centred probability 
measure. Then, using the correspondence between right-continuous martingales and minimal time-changes 
of Brownian motion, given r such that W T ~ ft we can define X = {X t )o <t<T — (H 7 4 ,)o<t<T such that 
X has law ft at a fixed time: simply take A t = tJ—t A r. Then we have a martingale with marginal ftr 
at time T. Moreover, given solutions of the Sep for non-trivial initial laws (the forward-starting Sep) we 
can use concatenation to construct processes with prescribed univariate marginals at a finite set of times 
0 < *1 < t 2 < . . . t„ < T. 

Following work of Beiglbock et al [5] the problem of constructing martingales with given laws at two times 
(whether via a connection with the Sep or otherwise) is sometimes called the martingale optimal transport 
problem (Mot). Given a pair (ft, it) of measures which are increasing in convex order, the Mot is to 
construct a coupling or joint law with marginals (/z, v) which respects the martingale property. Mot theory 
has proved extremely useful both in constructing extremal consistent models, and in proving martingale 
inequalities (see e.g. Acciaio et al [1]). Again, from Mot based on pairs of measures, there is a natural 
extension to finite families of measures. 

The problem of constructing a fake version of a continuous-time martingale can be thought of as the 
infinite-marginal generalisation of the multi-marginal Sep or Mot. Indeed, the existence of solutions of 
these problems underpins a natural approach to the mimicking problem: discretize time using a sequence 
of partitions to reduce the continuous-time problem based on (ftt.)o<t<T to a sequence of discrete-time 
problems with marginals (fl^ )o<k<K(n) (where jxf^ 1 = ftkT/K(n))'i construct a sequence of discrete-time 
martingales X^ with marginals (ft^)o<k<K(n)i re-interpret these discrete-time processes as skeletons of 
piecewise constant continuous-time martingales X( n ' ) ; and finally show that the processes X^ converge to 
a consistent martingale. It is this last step which is potentially difficult, especially if we want to show that 
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some optimality property is preserved in the limit. This is the approach taken in both Henri-Labordere 
et al [16] and Kallblad et al [25] based on solutions of the Sep/Mot due to Beiglbock and Juillet [6] and 
Henri-Labordere et al m respectively. 

The approach in this article is different. Rather than using an approximation technique we aim to write 
down the mimicking martingale directly. Conceptually, this is more direct than the approximation procedure 
used in m and |25j . There is one key idea which allows us to do this: our solutions are based on the principle 
that whenever possible we leave the location of mass unchanged. The processes we construct have finitely 
many jumps in every interval [e, T] and are constant between jumps. 

Our main contributions are as follows. Firstly, under fairly weak assumptions we give a family of mar¬ 
tingales whose univariate marginals match a given set of measures. Secondly, under an extra simplifying 
assumption - the extension of the ‘dispersion assumption’ in [22] from pairs of marginals to a continuum of 
marginals - we identify the consistent martingale with smallest expected total variation. (Note that if our 
goal is to mimic a martingale diffusion M then necessarily M has infinite variation, almost surely.) Thirdly, 
we give a pathwise inequality which can be interpreted as a model independent sub-hedge in the exotic 
option pricing setting. 

In particular, we show that there is a martingale with Gausian univariate marginals (mean zero, variance 
t at time t ) which unlike Brownian motion has finite expected total variation. More generally, suppose 
(pt)o<t<T is a family of probability measures which is increasing in convex order, and suppose that /i t has 
a continuous density p(t, x ) and is such that p is differentiable in t (and such that p and p have certain 
integrability properties). Now add the hypothesis that p(t,y) < K(t)p(t,y) for some decreasing function 
K(t): in the Brownian case we can take K(t) = 1/2 1. In this setting we show how to construct a consistent 
martingale from a Poisson point process and a family of martingale couplings. Different martingale couplings 
will lead to different mimicking processes, so this construction potentially yields many consistent martingales. 
Further, since the dispersion assumption is satisfied in the Brownian case, we can construct the fake Brownian 
motion of minimal expected total variation. 

2 Construction of a family of mimicking process 

2.1 Set-up, assumptions and examples 

Fix e,T with 0 < e < T < oo and let T be the open interval T = (e,T) and T = [e,T]. We suppose we are 
given {pt) te f; our aim is to construct a martingale X = (X t ) te f with marginals (A*t) te f ■ 

We begin by stating our assumptions on the family of measures (pt) tG f The first assumption is necessary 
for the family (/it) tg f to be increasing in convex order, and hence by Kellerer’s Theorem [26) for there to 
exist a martingale with this family of marginal distributions. A key quantity is U(t,x ) := / \y — x\pt{dy) 
which is (minus) the potential of pt- Then U is non-decreasing in t for each fixed x if and only if the family 
(pt) tS: f is increasing in convex order. 

Assumption 2.1. (a) For all t G T, f pt{dy) = 1; f \y\p t (dy) < oo; J ypt(dy) = ~p- 
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(b) U(t,x) := J\y — x\nt(dy) is non-decreasing in t £ T for each x. 

Now we impose some further regularity conditions on the potential. 

Assumption 2.2. (a) The potential U(t,x) is continuous in t for t £ T. 

(b) Qt{x ) = Q{t,x ) := \U(t,x ) exists for each t £ T and is continuous in x and t. Also , Qt(x) satisfies 
linLj^-i-oo Qt(x) = 0 and lim x _ ) .-|- 00 Qt(x) — xQ' t (x) = 0. Since U is non-decreasing in t we have Qt > 0. 

(c) For each t £ T, Qt(-) is the difference of two positive, decreasing convex functions. In particular, 

Qt{x ) = A t (a;) — T t (x) where linvpx, A t (x) = 0. Further, A t is such that m{t) := lim^-oo A t (x)/\x\ 

and n(t) := lim^-oo A*(a;) — xA' t (x) are finite. Finally, f teT m(t)dt < oo. 

Since At and are convex we can write At = A" and "ft = T", whence 7 1 and At are measures (which we 
argue below have the same finite total mass m(t)). For uniqueness we insist that At and 71 are orthogonal. 
This is equivalent to choosing the decomposition of Q t = A t — r t such that A t and r t are as small as possible. 
Then qt = Q" may be interpreted as a signed measure and qt (dx) = At (dx) — "ft (dx). 

(d) For each t £ T, "ft is absoluteluy continuous with respect to pit and the Radon-Nykodym derivative 

R t (x) = "/t(dx)/pit.(dx) is measurable in x and t and is bounded in the sense that Rt{x) < K(t) for 
some function K(t) which is bounded on T by K. 

(e) For any measurable set B and for all t £ T, fit(B) = pt e (B) + f* q s (B)ds. 

Fix t £ T. By assumption lim a; _ > .-|- 00 Qt(x) = 0 and since Qt is the difference of convex functions and 
since lim^^-oo A t (x) = 0 we have lim^oo r t (x) = 0 also. The fact that lim^oo A t (a;) — xA' t (x) = 0 follows 
from the convexity of A f as does the existence of lim^-oo A t (a;) — xA' t (x) in [0, 00 ]. Hence the content of the 
assumption on n(t) is that n(t) is finite. 

Convexity of A t implies that lim^-oo A t (a;)/|ir| exists in [0,oo], so again, the content of the assumption 
on m(t) is that this limit is finite. In addition lim^-oo Qt{x) = 0 by hypothesis and we conclude that 
lim^i-oo r t (a;)/|a;| = m(t) also. In particular, "f t and A t have the same total mass, and qt has zero total 
mass. 

Finally, since linx^-oo Qt(x) — xQ' t (x) = 0 and lim^-oo A t (x) — xA' t (x) = n(t), we conclude that 
lim^-oo r t (a;) — ajT^a;) = n(t) also. Hence A t and 7 1 have the same mean. The fact that Qt > 0 im¬ 
plies that the pair ( 7 1 , At) is increasing in convex order. 

Remark 2.1. Our ultimate goal is to construct a process on [0,T], However, it is clear from the examples 
that below that often there is no universal bound K for K(t) on intervals of the form T = (0,T), and the 
processes we build will have infinitely many jumps on such intervals. To circumvent any problems related 
to the presence of infinitely many jumps near t = 0 we first construct processes with time parameter set 
T = [e,T], and then extend to larger intervals such as [0,T] by concatenation. See Section [3] 

Remark 2.2. The requirement that Q exists for all t £ T can be relaxed to an assumption that Q exists 
and has the properties of Parts (b), (c) and (d) of Assumption 12.21 on a subset To C T of full measure. 
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For each t € T, let If be the smallest interval such that p t has support in If. Let the endpoints of If be 
given as {^(t), r M (f)}. Let I = U te flf- Similarly, let if with endpoints r\(t)} denote the smallest 

interval such that A t has support in if, with similar conventions for other measures. Then, since the pair 
(yt, X t ) is increasing in convex order we must have f? M (f) < I\(t) < f? 7 (f) < r 7 (t) < r\(t) < r^(t). 

Definition 2.1. By the regular case we mean that U(t,x) £ C 1,2 (so that the density p{t,x) of pt and 
pit,x) exist and are continuous), that for each t £ T, f \p(t, x)\dx < oo and f \x\\p(t,x)\dx < oo and also 
f T dt J | p{t, x)\dx < oo. 

Remark 2.3. Suppose we are in the regular case. Then T t (a;) = Jf°(y — x)p(t, x)~dx and A t (x) = jf° (y — 
x)p(t,x) + dx. Since / \p(t,x)\dx < oo and / \x\\p[t,x)\dx < oo it follows that m(t) and n[t) are well defined 
and finite. Then 

m(t)= lim lim { f°° Vp{t 'f dy + f°° p(t, y)~dy\ = f°° p(t,y)~ dy 

24-00 \x\ *4.-00 1 J x \x\ J x ) J_ 00 


and n(t ) = yp{t, y) dy. 

It follows from the condition that f \p(t, x)\dx is integrable on T and an application of Fubini’s theorem 
that 0 = p(s, x)dx, and hence that m(t) = f p(t,x)~dx = J p(t,x) + dx. Similarly we find that n(t) = 
f xp(t,x)~ dx = f xp(t, x) + dx. If H is bounded and measurable, we have 



H(x)q u (dx) 


H(x) J p(u,x)dx = jH{x)[p(t, x) — p{to, x)]dx = JH(x)pt[dx) — jH(x)p,t 0 (dx) 


and taking H(x) = I{ x eB} we find fjjp, t (B) = qt(B). Finally, in the regular case, Rt[x) = p(t,x)~ / p(t,x), 
and Assumption I2.2f dl becomes p(t,x)~ < K(t)p(t,x) uniformly in x £ I, for some decreasing function K. 
In particular, in the regular case verifying that the conditions of Assumption 12.21 are satisfied reduces to 
verifying that p(t,x)~ < p(t,x)K(t). 


We give illustrative five examples. The first two examples are natural examples where the situation is 
regular in the sense of Definition 12. II The third example is also completely natural, but is such that although 
Pt and 7 1 have nice densities the measure A* is atomic. The fourth example has a reverse structure in which 
7 1 is purely atomic (and pt has an atom), but At has a density. The final example is beyond the scope of 
our methods: the issue is not that p t has atoms, but rather that the atoms of pt move over time. 

Let S y denote the point mass at y so that 6 y (B) = I{ yGB y Define <h(a;) = e dy. 

Example 2.1 (Brownian motion). Let p t be the law of a centred Gaussian variable with variance t. Then 
the family (pt)t> o is the family of marginals of Brownian motion. We have 


U(t,x) = 2 \/t • 


o-* 2 /2 1 


V2tt Vi WiJ 2 Vi 


( 4 = 1 - 


P (t, x) = -U"(t,x) = -j= 


-x 2 /2t. 
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1 • 1 e -a;2 / 2t 
Q(t,x) = -U(t,x) = 


R t (x) = 


p{t,x) = 
(t — x 2 ) + 




2 1 2 


V-*); 


< 


i 


=: K(t). 

1 e- 1 ' 2 


2 1 2 - 2 t 

For this example m(t) = ^ f\(l ~ 2 / 2 )e _y2 / 2 -^|= = an< ^ n W = 0- 

Example 2.2 (Exponential Brownian motion). Let pt be the time-t law of exponential Brownian motion. 


U(t,x) = 2$ 


— In x yft 

“7T + T 


- 2x$ 


— In a; y/t 
~t 2 " 


= \v"(t, X) = - 3 ^ 


e -(\nx) 2 /2t e -t/S. 


1 • 


Q(t,x) = -U(t,x) = 2V — 


e -{Xnx) 2 !2t e - m = ^_ p{ix) . 


P(t,x) = ^71 p-lO-nx) 2 -t~t 2 /% 


R t (x) = 


1 1 

8 + 2 1 


(In a;) 2 


2 1 2 

< 1/(2*) + 1/8 =: K(t). 


Note that this example is asymmetric about the mean 1 and n(t) 1. 

In general there are few examples of fake exponential Brownian motions — a rare exception is [21]. The 
lognormal law does not have the scaling properties of the normal distribution, and as a consequence it seems 
harder to construct elegant mimicking processes. 

Example 2.3 (Continuous uniform). Let ~ U[—t,t]. Although has a density, A t consists of a pair of 
point masses. 

(t 2 + x 2 ) 

U(t,x) — I{—t<x<t} T l a 'l-^{|x|>t}> 

P{t,x) —U (t,x) —I{ — t<x<t} j 

Q(t : x ) —U(t,x) ^ ^f — *< X <*1’ 

dx 

lt{dx) = ^ 2 'I{—t<x<t}i 

. l t 1 - 

At “ Yt 6 ~ t + Yt 6t ’ 

R t (x) = jl{- t<x<t } < j := K(t). 

In this case y t = jU[—t,t] and X t = jU{—t,t} so that the signed measure q t does not have a density. 
Nonetheless, for B = (b,b) with 0 < b < b (for example) we find pt{B) = 

bAt—bAt 


an d TtPt{S ) = TtkbKtKb}- 


2 1 


If— = qt{B) for all t except t = b and t = b. Hence pt((b, b)) = /j, e ((b, b)) + J e q s {(b , b))ds. 
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Example 2.4. Suppose /i t ~ e ( <5o + (1 — e *)£/[—1,1]. 


U(t,x) = 

Q{t,x) = ^U(t,x) = 
7t = 
A t{dx) = 

Rt{x) = I{x=0} — 


(1 + X 2 ) 
2 


(1 — e _t ) + e _t |x| 


^{-i<x<i} + \x\I{\ x \>i}', 


e -t 

“^“(1 + x 2 — 2|x|))/{_i <rE <i}; 

e ‘(So; 

6 _ ^ 

-~ ^{—i<x<i}^ a; ; 

1 := K{t). 


Note that in the first three of these examples if T = (e, T) with e > 0 then we may take K = K(e), but 
if e = 0 then there is no global bound on R t (x). 

Example 2.5 (Two point Discrete Uniform). Suppose n t ~ \8 t + |5_ t . Then U(t,x) = max{t, |x|} and 
Q(t,x) = I{~t<x<t}- This indicator function cannot be written as the difference of two convex functions, 
and this example lies out the scope of our analysis. Note, it is not the presence of atoms in this example 
which means that our approach does not work, but rather the fact that the locations of the atoms move over 
time. 


2.2 The construction 

Our goal is to construct a pure-jump Markov martingale with the required marginals. The idea behind the 
process X we construct is that it stays where it is whenever and wherever possible. In Section |4] this will be 
one of the features which leads to X having minimal variation. It also leads to a simple construction: most 
especially we can write down the process directly, and no limiting procedures are used. 

The construction is based upon the twin ingredients of a Poisson point process and a family of martingale 
transports. The Poisson point process determines whether there is a jump, and then conditional on there 
being a jump the martingale transports determine the new location of the process. Our problem is to specify 
the ingredients in such a way that the resulting jump process has the desired marginals. 

First consider the point process. Let (N t )t >o be a Poisson point process on (0,oo) x (0, oo) x (0,1) with 
density ds x dh x du. Let M be the set of locations of events of the Poisson process. We think of the 
three coordinates as time, height and label. If we want to restrict the time domain of N to an interval 
A e.g. A = (e, T\ then we write N A for the Poisson point process (W)teA with density ds x dh x du on 
A x (0, oo) x (0,1). 

Conditional on there being an event of the Poisson Process at ( s,h,u ) for which R S (X S ) < h then the 
process jumps at time s. The location of the jump destination depends on the label u and the martingale 
transport. If the process is at x at time s, if the first two coordinates of N determine that there should be 
a jump, and if 7rf denotes the regular conditional distribution of the post-jump location, then we set the 
post-jump location to be F~x{u) where u is the label and F n x^y) = 7rJ((— oo, y]). By convention (this will 
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not impact on whether X has the required marginals) at the time of the jump X takes the value it jumps 
to rather than the value it jumps from. Hence, the process X we construct is cadlag. 

Now consider the family of martingale transports. Suppose vo and zq are probability measures on I. 
Let AJi(zqi,zq) be the set of martingale couplings or martingale transports from v® to zq. In particular, 
7r E A'li(r'c), zq) is a probability measure on Ixl which satisfies the marginal constraints f n(dx, dy) = v$(dx) 
and f x n(dx, dy) — vi(dy) and the martingale constraint f y (y—x)n(dx, dy) = 0. Then 7 r has an interpretation 
as a rule for the ‘transport’ of mass from uq to zq which respects the martingale property. We have that 
zq) is non-empty if and only if zyj and zq are increasing in convex order which we now assume. 
Any 7r E Ali^OjZq) can be written in disintegration form ir(dx,dy) = n x (dy)uo(dx). Then f y n x (dy) = 1, 
f y yn x (dy) = x and f x TT x (dy)iso(dx) = zq(dy). If zz 0 and zq are orthogonal then 7r x ({x}) = 0. 

The notion of a martingale transport generalises easily to any pair (z>o,zq) of measures with the same 
finite total mass which are in convex order. Let m be the mass of vq. If zq = zq/m and if zf E Afi(zq), zq) 
has disintegration representation jr(dx,dy) = Vo(dx)Tr x (dy) then 7r given by n(dx,dy) = mir(dx,dy) = 
m.i'o(dx)TT x (dy) = Uq( dx)ir x (dy) is a martingale transport of D 0 to zq. Let M = AI(z'o,zq) denote the set of 
martingale transports in this wider sense. 

Several martingale couplings are known and can be described in more-or-less explicit terms. Many of 
the couplings are associated with solutions of a Skorokhod embedding problem for Brownian motion for a 
non-trivial initial law. For a pair of probability measures in increasing convex order, Chacon and Walsh [8] 
give a general family of solutions, of which the construction in Hobson [19] is a special case related to the 
Azema-Yor [3] embedding (the Azema-Yor embedding covers the case of an initial law which is a point 
mass). The Hobson m solution inherits from the Azema-Yor embedding the property of having a very 
explicit construction. Bertoin and Le Jan [3j give a martingale coupling based on local times in the case 
where vq and zq are orthogonal. Hobson and Neuberger [231 gave the coupling which maximises the L 1 norm 
of the increment. Recently, Beiglbock and Juillet [6] introduced the ‘left-curtain’ martingale coupling. For 
our purposes the construction of Hobson and Pedersen P3] has certain desirable properties in that the first 
step of the construction is to fix any mass in common between uq and zq, consistent with the general idea 
of leaving mass unmoved wherever possible. The Hobson-Pedersen construction is one of the most explicit 
martingale coupling for general measures, and we describe it in detail in Section [2751 below. In Section |4] we 
will exploit the coupling of Hobson and Klimmek [22] which like the constructions of [23] and [B] has certain 
optimality properties. 

Suppose we have our Poisson point process and a family of martingale transports (Ttt)teT with 7r t E 
•Ad( 7 t,At). We begin by explaining, first informally and in the regular case, why the process X — (Xt) t£ j 
we construct has density p(t,x). 

Suppose that at time »El the process is at x and x is in the support of and more generally that x is 
in the support of for v < s < T. Then (X t ) v < t stays constant until r = inf{s > v\3(s,h,u) E J\f D (h < 
R s (x))}. Note that since R s (x) < K(s) < K we have that r > v almost surely. Then, from the properties 
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of the Poisson process, for fixed v, t and x 


({A/’fl {v < s < t) fl (h < i? s (®))} = 0) = exp 


R s (x)dt 


= exp 




p(t,x) 
p(v,x )’ 


and it follows that 

P(X t £ dx ) = P(Al„ £ dx) X \ = p(t,x)dx. 

p(v, x) 


Now suppose that x is not in the support of 7 ,,. Consider the rate of change in density at x at time v, 
assuming that X v has density p{v,x). We have 


P (X v+dv £ dx) = P(X.„ £ dx) + [ P(A„ £ dz)dv R v (z)-K*{dx). 

Jzelj 

If 7 r t £ At) then the right-hand-side of (Q]) can be rewritten as 


( 1 ) 


p(v,x)dx + / 

JzeK 


p(v, z)R v (z)dzTTy(dx)dv = p(v,x)dx+ / ^ V {dz)^l{dx)dv 

Jzeil 

= p{v, x)dx + X v (dx)dv 
= p(v,x)dx + p(v,x)dx dv, 


and hence P(X^ +( j„ £ dx) = p{v + dv, x)dx as required. 

Our goal now is to extend this heuristic argument to prove that X t has law p t - For t £ T define the 
operator A J 1 ’* by 

A^hi.x) = R t (x) | J ir* (dy)h(y) - h(x) 

and recall that this is non-zero only if re £ supp( 7 t). Suppose (7r t ) tg T is chosen with 7 r t £ Ad( 71 , At) and let 
X = (Xt)t £t be the Markov process with generator A1* and initial law X e ~ p e . Then X is a pure-jump 
Markov process. 


Assumption 2.3. Suppose the family (7r t ) t6 T is such that 77 £ Ad( 7 *, A*) has representation 7 T t {dx,dy) = 
7 t(dx) 7 rf (dy) and for all measurable sets B, wf(B) is measurable in x and t. 

Our main theorem says that under the assumptions of this section, the process X = has marginals 

{pt ) te f ■ The proof of this result relies on results of Feller _12! and Feinberg et al mi on the existence and 
uniqueness of solutions of the Kolmogorov forward equations for pure-jump Markov processes. Feller [12] 
proves existence under an assumption that jump rates are continuous in t, and this condition is weakened 
to measurability in HU. The idea in these papers is to use the fact that there are only finitely many jumps 
(almost surely, in T) to construct the process and its transition functions as a limit of processes which have 
at most n jumps. These pure-jump proceses with a bounded number of jumps are simple to characterise, and 
it is possible to write down the probability transition functions in terms of integrals involving the transition 
rates. 
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Theorem 2.1. Suppose Assumvtions \2.1\ 12.21 and \2.3\ hold. Suppose T = (e,T) and suppose X e ~ p e . Then 
for t S T, X t ~ p t ■ 


Proof. For tg T, let denote the law of X t . Then u e = p e . Further, since Rt{x) is bounded, I is g-bounded 
in the language of Feinberg et al m, and by Feller [T2} Equation (37)] or Feinberg et al [TTJ Corollary 4.2], 
for measurable sets B 1 


v t (B) = p t {B) + f ds [ v s {dx)R s (x)'K x s (B) - f f R s (x)v s (dx). 
Je J xG/ J e J B 

Conversely, by construction and design the family (pt) te j is such that 
Pt(B) = p e (B) + ( ds q s (B) 


= MB) 
= MB) 
= MB) 



Xs(dy) - ds J 7 s (dx) 

7 s{dx)n x (dy) - J ds J 'Ys(dx) 
p s (dx)R s (x)Trf (B) — j ds J R s (x)p s (dx). 


/ x£lJy£B 


( 2 ) 


(3) 


Let (Mtef family of signed measures given by r/t = vt — pt, and note that rjt(I) = 0. Let At denote 

the total variation distance between v t and pt so that A t = sup B \r]t{B)\, and let A t = sup e<s< t A s . Then, 
comparing © and m 


Vt(B) = f ds f p s (dx)R s (x) f irf(B) - I {xGB } 
Je J x(zl L J xdzl 


and it follows that 


\Vt(B)\ < ds \Mdx)\K{s) < 2 / K{s)A s ds < 2K / A S ds. 

Je J xG/ «/ e J e 


Hence A t < 2 K f* A S ds and since the right-hand-side of this last expression is increasing, A t < 2 K J* A S ds. 
It follows from Gronwall’s Lemma (Ethier and Kurtz [10} Appendix, Theorem 5.1]) that A t = A t = 0 
and X t ~ pt for t € T. Finally, since the rate of jumps is bounded, F(Xt € B) = lim s -|-T P(A S G B) = 
lim s -fT Ps(B) = pr{B) by Assumption 12.2l eh □ 


2.3 An explicit martingale transport 

The goal of this section is to show that even when and At are arbitrary (integrable) measures we can write 
down a martingale coupling. 

Fix t and write 7 as shorthand for 74 , and similarly A as shorthand for At, where the orthogonal measures 
7 and A have the same finite total mass and the pair ( 7 , A) is increasing in convex order. Our aim is to write 
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down a martingale transport n(dx, dy) which has marginals 7 and A and respects the martingale condition. In 
fact we write down n x (dy)-, then n(dx, dy) = r ){dx) , n x {dy). The construction we give is based on the solution 
to the Skorokhod embedding problem with non-trivial initial law given in Hobson and Pedersen [24] . The 
only difference with respect to that paper is that there 7 and A are probability measures, but the extension 
to measures with finite mass does not introduce major complications. 

Let Q{x) = ±{E A '~ A [|A- — x|] — E a ~ 7 [|A' — x|]}. Since 7 and A are in convex order we have Q > 0, and 
Q is continuous in x. Define 


£(*) 


sup 

w:w<z 


Q(z) - Q(w) 
z — w 


and let g(z) < z be the largest of those values where the supremum is attained, or if the supremum is not 
attained then £( 2 ) = Q'(z-) and we set g(z) = z. Now define E(z) = A((— 00 , z)) — £(z). Then (Hobson and 
Pedersen [241 Theorem 2.1]) E is an increasing, left-continuous function. 

Decompose A into a continuous part and a sum of atoms: A(A) = J zeA \ c {dz) + -Ml 0 })- For z > x 

define T x (z) by 


T x (z) = exp 



A c {dw) 

) 7((-°o,w)) - 



A(W) 

7 ((-oo,u)) - E (u) 


+ 


T x can be thought of as a survivor function of a probability measure. 

For x in the support of 7 we define ir x separately onz>i and z < x. For z > x, we set 

f‘00 / nW J \ 

**&,<*>)) = l 7277) 

and for z < x, 


The following result is an immediate consequence of Proposition 5.4 in [21] . 


du 


u- g(u) 


T>,)- 


xu n r r dw ( 

* ((-«>» *]) = J x W)<4 m _ gM ex P ^ 


(4) 

(5) 


Proposition 2.1 (Hobson and Pedersen [24|). With n x as above and n(dx,dy) = r y(dx)ir x (dy) we have 
7 T G At( 7 , A). 


Example 2.6. Suppose 7 ~ jU[—t,t] and A ~ ^S-t + ^S t (as is the case for Example 12.311 . Then 
Qi x ) ® )i{—t<x<t}- 

We find £( 2 ) = jp[t — z) ior —z < t < z (and £(z) = 0 otherwise), and g(z) = — t for — 2 < t < 2 (and 

g(z) = z otherwise). Then S (z) = (t + z) + for z < t and for x G (— t,t), T x (z) = I{ z <t}- Note that for 

z<t, f z dw , , = f z ^ = In |±2 and hence exp(- f z dw , , ) = f±s. Then for -t < x < t and 2 > x we 

— 7 Jx w—g(w) Jx t~\~w t-\-x L v Jx w—g(w) J t-\-z 

have 7r x ([2,oo)) = ^/{ 2 < t }. 

For —t<x<t and z < x we find tt x ((-oo, 2 ]) = I{-t< z } f* = J {-t<z}[-f+^=z = 

The situation can be summarised as n x = S_ t + ^T^t- 
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3 Extension to [0, T] 

In many of our examples the bound K (t) diverges at t = 0 and the methods of the previous section are only 
sufficient to construct a process on [e,T] for e > 0. (A rare exception is Example 12.41 1 In the case where 
K{t) diverges at zero we give two approaches to constructing a process on [0, T\. The first approach is based 
on the Kolmogorov consistency condition and is non-constructive. For related ideas in a similar context, see 
Oleszkiewicz [29] . The second approach requires extra assumptions, and makes use of time-reversal. 


3.1 Consruction via the Daniell-Kolmogorov Consistency Theorem 

Let e be a parameter in (0,T). Let X F) = (X^) e < t <T be the Markov martingale with initial law p, e 
and marginals (/q) e <t<T constructed as in the previous section from the family of martingale transports 
( 7 T t^)e<t<T and the realisation = (A ft^) e <t<T of the Poisson point process (N^) e<t < T . Write X( e ) = 
(xj:^) e <t<T = X( e \Xe e \ (J\f^ )e<t<r) to indicate that given the family of martingale transports the path 
(*| e) ) £<«t is a deterministic function of the initial value (at time e) and the realisation of the Poisson point 
process. 

Suppose now that we are given a family of martingale transports ( ^t)o<t<T and a Poisson point process 
N on (0,T] x (0, oo) x (0,1). For any 8 in (0,T) we can consider the restrictions of (nt)o<t<T and ( N t )o<t<T 
to (5, T], Write 7r (<5) = (n { t d) )s<t<T ■= (n t )s<t<T and = {N^ S) ) s< t<T ■= ( N t )s<t<T for these restrictions. 
Then define X^ = (xj:^)s<t<T = X( S \X$ S \ (Af^)s<t<T) where xj' 5 ' 1 is a random variable with law fj,$. 
It follows that if 8 < e and X^ 1 is taken to equal xi ^ then there is an identification or coupling between 
processes started <rt different times nnd and (X t ^) e<t<T are identical, omega by omega. 

For 0 < t\ < t 2 < • ■ ■ < t n < T define 


A*t i 




x. 


(<9 
*2 ’ 


,xn. 


( 6 ) 


By the above comments, A*t 1 ,t 2 ,...,t n does not depend on e (provided e < ti). Also it is clear that the measures 
{l JI 't 1 ,t2,...,t n )nEN,o<ti<t2<-~<t n <T are consistent, since any pair are the finite dimensional distributions of a 
stochastic process (X^) e < t <T for some e sufficiently small. Hence, by the Kolmogorov consistency theorem 
there exists a stochastic process X = ( X t )o<t<T such that the finite dimensional distributions of X are given 
by the measures in ([6]). 

Let H>k,T = {j2~ k T; 1 < j < 2 k } and let By = Ufe^iB^r so that B^ is the set of dyadic rationals on 
(0,1] rescaled to (0,T]. Consider (X t ) te D r - Fix 0 < to < t± < T with ti G By. Then with e < to, and using 
the right-continuity of X^ e \ 


P(x t — x t0 ,vt g Bt n [to, ti]) 


limP(X t = X to , Vt G Bfc t n [t 0 ,ti]) 

k 

limP(X t (e) = x£yt G b kiT n [to, ti]) 


P(Xj C) = xj*\\/t G [t 0 , ti]) 


E 


exp 



Rtixtf 



> exp(—K(ti 


to)), 
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for some constant K. In particular, on a set f2i of probability 1, X : i—>• R is piecewise constant and has 

finitely many jumps on any closed sub-interval of (0,T]. 

On Oi let X : (0, T] R be defined by X t = lim S 4. tiSe D T X s , and on the null set let X t = JI. Then 
every path of X is piecewise constant and right-continuous on (0,T] and has finitely many jumps on any 
closed sub-interval of (0,T], 

Suppose either that I\(t) is bounded on (0,T) or that fit converges in L 1 to a point mass, necessarily at 
Ji. In the former case let Ao = lim s ^o,seD T X s (which exists since X has only finitely many jumps in (0, T] 
if K is bounded) and in the latter case, let A'o = JI. 

It remains to check that X has the desired marginals, but for t > 0 this is immediate given Assump¬ 
tion I2.21 e): 

P(X t £ B) = lim P(X S £ B) = lira uJB) = MB). 

s4.t,seB T s4.t,seB T 

When t = 0, if K(t) is bounded on (0,T) then the same argument applies, otherwise A 0 ~ fi 0 ~ <5jr by 
definition. 

The fact that A' is a martingale (relative to its own natural filtration) follows as in Oleszkiewicz [29] . 

3.2 Extension to [0, T] via time reversal 

Under some extra conditions it is possible to sketch an alternative and more direct construction of a process 
on [0, T], In particular in this section, in addition to the hypotheses of Assumptions 12.ll 12.21 and 12.31 we 
assume that we the family of marginals (fit)tz[o,T] is such that lim t j,o U(t,x) = \x — Jl\ (and then yo ~ 6-p) 
and further that the Radon-Nykodym derivative A t(dy) / fi t (dy) is bounded in y by a function K(t) which is 
bounded on every interval of the form [e,T] with e > 0. 

Throughout this section we assume T is fixed. 

Temporarily fix e. Using the ideas of the previous section we can construct (X^) e < t <T such that 
A^ ~ fi t . Now let A'( e ) = (Ai £ ))o< s <T-e be given by A^ = X^}_ g . Then X^ is the time-reversal of X^ 
and A| e) = X^_ s ~ fir-s fis Indeed, the process X is a pure-jump Markov process with state-dependent 
jump intensities R s {y ) and conditional jump measures iff where 

5 t ^ _ A T-s(dy) ~v/ i \ _ n T-s(,dx, dy) _ 7 T - s (dx)M_ s (dy) 

s[y) - fi T -s(dy) *sM)- XT _ s{dy) - A T - S (dy) 

In particular, X^ does not depend on e in the sense that if Xq 1 = Xq 6 ' 1 then X^ and X^ are pathwise 
identical on the set [0, (T — S) A (T — e)] on which they are both defined. 

Hence we may drop the superscript. Let X be the pure-jump Markov process with initial law A 0 ~ fi$ = 
Ht defined using R s and iff as in the Section [7721 except that here we take X to be left-continuous. (Note 
that iff does not respect the martingale property, so that the process X will not be a martingale.) Then 
X is defined on [0,T), and X s ~ fi s = fir-s- Now set Ao ~ 6-p and X t = Xr-t- Then A = (X t )o<t<r is 
well-defined on [0,T] and Xt ~ fit- Further X agrees with X^ on [e,T] and so A' inherits the martingale 
property from X' € K 
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Example 3.1. Suppose p t ~ e ‘do + (1 — e *)£/[—1,1]. Then 7 t ~ e *5* and A t ~ e *C/[—1,1]. We find 
that for y e (-1,1) \ {0}, R s (y) = 1 l e ^ T l ) a) and = <5 0 - 

Then the time reversed process is well-defined on [0, T], The process starts at 0 with probability e~ T and 
if it starts at 0 then it remains constant at zero throughout. Otherwise it starts at a non-zero point chosen 
according to the uniform distribution on [— 1 , 0 ) U ( 0 , 1 ], and stays constant until the time of the first event 

_ (T —s) 

of a time-inhomogeneous Poisson process with rate , at which point it jumps to zero. Zero acts as 

an absorbing state. 

In forward time the process starts at 0 and jumps to a uniformly distributed location on [—1,1] \ {0} at 
rate 1 . 

Example 3.2. Suppose that for t € [0,T], fi t ~ U[—t,t\. In this case A r-t is not absolutely equivalent with 
respect to y-T-t, and all calculations for this example should be viewed as formal calculations. (Nonetheless, 
this example is sufficiently concrete that it is easy to verify that the proposed processes have all the right 
properties.) We find R s {y) = 0 except at y = ±(T — s), and at those points it is infinite. The interpretation 
is that X only jumps when X s = T — s. Conditional on their being a jump we find nJ~ s (dx) = dx 

and 7 T f^ T ~ s \dx) = ^ x ■ 

The process X can be described as follows. The initial value Xo is uniform on [—T,T], The process 
remains constant until t\ = T — |xo|. At time t± the process jumps to a new point x\ in the interval 
(—(T — n),T — n) chosen according to the density f(x\) = ~ The P rocess can then be 

constructed inductively. Suppose that after k jumps and at time s the process is at Xk where necessarily 
Xk G [—(T —s), T—s]. The process remains constant until Tfc+i =T—\xk\ at which point the process jumps 
to a new point in (—|xfc|, |xfc|) = (—(T — Tk+i), T - Tk+i) chosen with density ^ T ~\ x ^ x ^^9^ x k)) _ 

4 Minimising the expected total variation 

In this section we want to explain how the ideas of Section [2] can be used to construct an extremal process 
consistent with a given set of marginals under an additional assumption on the relationship between the 
measures 7 t and A t . In particular, the process we construct will have smallest expected total variation 
amongst all processes consistent with a given set of marginals. As an application we construct the fake 
Brownian motion with smallest expected total variation. 

Our analysis is motivated by the martingale coupling constructed in Hobson and Klimmek [22] . A key 
assumption in the construction of that paper is that mass is moved using a martingale transport from an 
interval in the centre to the tails. We make a similar assumption, which is augmented with some further 
regularity conditions. A similar condition to Assumption Id. If al is stated in Henri-Labordere and Touzi [IT] 
as the difference in the cumulative distribution functions (of A t and 7 1 ) having a unique local maximum. 

Assumption 4.1. (a) (Dispersion assumption) There exists a family of open intervals (E t )te t such that 

supp^jt) C E t and supp(Xt) C Ef. Moreover, the intervals (E t )te t are increasing: for s < t we have 
E s Q Et; 
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(b) (Regularity) U(t,x) £ C 1,2 and Qt(x), Q' t (x) and r^a;) are continuous in t for all x. 

(c) (strict positivity of densities on their domains) For each t £ T p(t,x) < 0 for x £ E t (and then 
E t = (£^ft),r^(t))) and p(t,x) > 0 for x £ (£\(t),£^(t)) U {r^ft),r\(t)); 

(d) (Finite activity) /(|a;| + l)\p(t,x)\dx < oo for each t £ T and J teT dt /(|x| + l)\p(t,x)\dx < oo. 

Consider the Brownian motion fExample l2.ll) and exponential Brownian motion fExample l2.2D examples. 
For the former Eft) = (—y/t, y/t) and I t = K. For the latter, Eft) = (exp(— \Jt + t 2 / 4), exp( \Jt + t 2 /4)) and 
It = (0, oo). In both cases all the elements of Assumption 14. II are satisfied. However, in the uniform example 
f Example 12.31) and in Example 12.41 we find U(t,x) C 1 ’ 2 , and these examples do not satisfy the regularity 
conditions (b) and (c) of Assumption 14.11 Nonetheless, the dispersion assumption (Assumption 14. H all is 
satisfied in Example 12.31 mass is moving from a central region to the tails and there exists an open interval 
E t = (I 1 ft),r 1 (f)) = {—t,t) such that p(t,x) < 0 on E t , and supp(A t ) C Ef. In Example 12. 41 the spirit of 
Assumption 14.11 a! is satisfied in the sense that mass is moving from a central region to the tails, but in this 
case E t = {0} is not open. For these examples it is possible to prove optimality directly, see the discussion 
in Example 14.21 below. 


4.1 Construction of the martingale transport for fixed t 


In this section we will consider t to be fixed and generally write t as a subscript. Later we will consider t as 
a parameter. 

Our goal is to show how to write down a martingale transport Tr t £ Mfyt,\t) under Assumption 14.11 
In particular, the idea is that 7r t has a decomposition 7r t {dx,dy) = "ft{dx)Tr((dy) where 7rf has a binomial 
distribution: 

_ b t {x)~x x x-a t (x) x 

t b t fx) — a t fx) at ^ btfx) - a t (x) bt(a ^ 


Thus, for this candidate martingale transport, mass at x moves down to at(x) or up to b t (x), where at{x) < 
x < bt(x), and the probabilities of up and down moves are such that the martingale property is satisfied. Our 
goal is to explain how to choose atfx) and b t (x) such that 7r t £ M.fy t ,Xt)- A key element of the construction 
will be that atfx) and btfx) are decreasing functions of x. 

Following Hobson and Klimmek [22] for x £ E t and y > x define Qt, x (y) = Qt{x) + Q' t {x)(y — x) — Qtfy)- 
For x £ E t define 

x t \ f Qt,x{0) ~ Qt(ot) 'I 

<ptfx) = max Q<x</ 31- _ — -| (8) 


and let the maximum be attained at the pair (a = atfx),fl = b t fx)). Under Assumption 14. II land especially 
the dispersion and strict positivity elements) Q t is strictly convex and increasing on {£\(t),£ 7 ft)), strictly 
concave on (£j(t),£'yft)) and strictly convex and decreasing on (r 7 (f),r.\(i))- This guarantees the existence 
and uniqueness of optimisers a t fx) and b t fx) in the right-hand-side of ([5J with a t (x) < i-yft) and b t fx) > r 1 (t). 
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Proposition 4.1. a*(-) is continuously differentiable in x on E t and satisfies 


, , . . . ... bt(x) — x 

a t [x) p(t,a t (x)) = - - --p(t,x). 

b t (x) - a t {x) 

Similarly b t (-) is continuously differentiable on E t and satisfies 

b' t (x)p(t,b t (x )) = - X at ^ p(t,x). 

b t \x) - a t {x) 

Proof. We prove the result for a t , the proof for b t being similar. 

The construction of a t , b t given in Hobson and Klimmek [22] is a martingale transport of probability 
measures, but generalises easily to pairs of measures with the same total mass which are increasing in convex 
order. By construction at(x ) = (Q[) _1 ° 4>t ° 7t((—oo, x]). By the continuity and strict positivity of p{t,x) 
on (£\(t),£y(t)), (Qt) _1 is continuously differentiable, as is 7 t ((—oo,a;]). Further, by Lemma 5.1 and the 
argument after Lemma 5.2 of Hobson and Klimmek [22], 4>t is continuously differentiable. Hence at(-) is 
continuously differentiable. The expression for the derivative is taken from Section 3.1 of [25]. Note that 
[22] covers the case of general probability measures (satisfying Assumption 14. H all including measures with 
atoms. □ 


Corollary 4.1. Define nt by nt(dx,dy) = r yt(dx)wf (dy) where nf is as given in 0. Then nt € A4(7t,A t) 
and irf (B) is measurable in x for all measurable sets B C I. 

Proof. Suppose H is a bounded, continuous test function. Then, using Proposition 14.11 in the second line, 
and a change of variables in the third, 


[ [ Tt{dx)rf{dy)H(y) 
J X J V 


= [ p{t,x) dx ( /M 3 ) — j-jH(a t (x))+ ^ - Qt ^ H{b t (x)) 

JE t \bt{x) - a t \x) b t {x) — a t (x) 

= - [ H(at(x))a' t (x)p(t, a t {x))dx - f H(b t (x))b' t (x)p(t,b t (x))dx 
J E t J Et 

= [ H(y)p{t,y)dy+ [ H{y)p(t,y)dy 

A t (dy)H(y) 


and it follows that n t £ Measurability in x of nf(B) is clear from the monotonicity of a t and 

b t . □ 


Now we introduce some auxiliary variables which will play a role in the evaluation of the minimal expected 
total variation. Fix Xq £ n te T^t- For x £ E t = [fy(t),r 7 (t)\ define 


0 t {x) 



2 

b t {z) - a t (z ) 


dz 


i>t{x) 



xo) ^ b t (z) - a t (z) 
b t (z)-a t (z) ~ 


17 








Extend these definitions to x £ E t by setting 


9t(x) 


6 t (a t 1 {x)) x < £y(i) 1 
Ot{K\x)) x > r 1 {t) J 


Ma t X (a;)) + K 1 {x) - a:)(l - 9 t {a t 1 (x))) x < £ 7 (f) ) 
MK 1 ( x )) + ( x -K 1 ( x ))( 1 + d t{bi 1 (x))) x > r 1 [t) J 


Set L t (x,y) = L^, ti g t (x,y) = \y-x\ + if t (x) + 9 t (x){y - x) - 


Proposition 4.2 (Hobson and Klimmek 122). L t (x,y) > 0, with equality for y G {a t (x),x,b t {x)}. Further, 
ipt{ x ) < |a; — a;o| and ift is convex on E t and concave on (£\(t),£j(t)) and (r-y(t),r\(t)). 


Proof. The only part of this result which is not contained in Theorem 4.5 of Hobson and Klimmek [22] is 
the fact that ift(x) < \x — xq\. But 


0 < L t {x 0 ,x) = \x- x 0 \ +ip t {x 0 ) + 9 t {x 0 )(x - x 0 ) - tpt(x) = \x - x 0 | - il>t(x). 


□ 


Corollary 4.2. For any random variables X,Y G L 1 with E[(Y — A')|X] = 0 we have ¥.[ipt(Y) — ip t (X)\ < 

n\y-x \]. 


4.2 Continuity in t 


Now we consider the dependence of the various quantities on t. Where appropriate we switch to writing t 
as an argument rather than as a subscript; for example we now write a(t,x) in place of at(x). 


Lemma 4.1. aft, x) and b(t,x) are continuous in t. Further, ip(t,x) is continuous in t. 


Proof. We have the representation a(t,x) = (Q't) -1 o <j) t o 7 t ((—oo, a;]). By assumption, Q' t and T( are 
continuous in t. Since <j>(t, x) is continuous in x, continuity of a will follow if cf>(t, x) is continuous in t. 

Recall (IS]) and the fact that x) is the slope from (aft, x), Qft, a(t, x))) to (bft, x), Qt,x(b(t, x))). Then, 
using continuity of Q t and Q' t in t (Assumption 14. li b’) 1 


xt4) ^ Qt',x{P)-Q(t',a) Q t ' iX (b(t,x))-Q(tf,a{t,x)) t'->f Qt,x(b(t,x))-Q(t,a(t,x)) 

(pft , x) = sup - - - > - —— --- > - —— --- =<t>{t,x). 

a<x <(3 p — a bft,x) — aft,x) b(t,x) - a(t,x) 


Conversely, given 77 > 0, for t' sufficiently close to t we have Q(t,y) > Q(t' ,y) — 77 and Qt, x fy) < Qt,x(y) 
77(1 + (y — x)), and then if r 1 (t') — i^ft') > r 7 (f—) — f 7 (t—) — 77, 

277 

- sup --- 1-7/-+-— - cpyi,x)-\-'H 1-7—— 

: — a 


xu , ^ Qt'AP) - Qft’,a) Qt, x ((3) - Q(t, a) 2t? 

cpft ,x) — sup - 1 --- < sup — 1 --- I-77+- - < (p{t,x)+r]+ 


/3 — a 


/3 — a 


ot<x<(3 P ™ ot<.x<f3 

Hence for fixed x, <f>(t, x) is continuous in t. 

Continuity of b(t,x) in t is similar, from which continuity of ip(t,x) follows. 


r 7 (f-) - £j(t~) - 77 

□ 


Corollary 4.3. Suppose Assumvtions \2.1\ 1 2. 2\ and \4-l\ hold. Then X t ~ pt. for t G T. 

Proof. It is clear that under Assumption 14.11 Rt (x) and 7rf are such that the measurability requirements of 
Assumption 12. 2l df and Assumption 12.31 are satisfied and hence by Theorem 12. II that X t ~ pt- □ 
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4.3 Minimising expected total variation: the primal approach 

The idea now is to show that, under Assumption 14.11 the process we have constructed has the smallest 
possible expected total-variation in the class of martingales with the specified marginals. 

Suppose e > 0 and let T = [e,T], Let P T = {to, ti, • • • tv} be a partition of T of dimension N, i.e. a vector 
of length N + 1 with t 0 = e, tk > tfc-i and tjy = T. We say V T = (Pj) n > i is a dense sequence of nested par¬ 
titions of [e, T ], if Pi = {tft, t?,... t n N(n) } is a partition for each n, Pj C {tg +1 , t” +1 ,... t" + \... i^ +1) } = 
Pj +1 andJim ntoo max 1 < fe < Ar(n ) |t£ - t^J = 0. 

Let Pj}„ = {t^’ n , tf’ n ,... t^."} be the uniform partition of [e, T] of dimension 2” in which t]7' n = e + (T — 
e)k2~ n , and let Vjj = (Py„) n >l- Then Vjj is a dense sequence of nested partitions. 

Definition 4.1. Let f : [e, T] > R be any function. The total variation of f along a partition P T is 

_ N 

V(P T ,/) = ^|/(t fe )-/(t fc -i)|. 

*:=i 

P/ie total variation of f along a dense sequence of nested partitions P T is 

N(n) 

V(P ¥ ,/) = lim^ |/(tD-/(tfc-r)l- 

fc=l 

P/ie /act t/iat t/ie sequence is nested means that the limit exists. 

The total variation of f is V T (/) = supV(P T ,/) where the supremum is taken over dense sequences of 
nested partitions. Note that V T (/) > V{Vjj,f). 

Definition 4.2. Let Z = (Z t ) te j be a stochastic process. Define the expected total variation V T (Z) of Z by 
V T (Z) = E[V T (Y)] = E[sup V(P T , Z)\, where the supremum is taken over partitions which may depend upon 
the realisation of Z. 

For the sequence of uniform partitions Vjj define the expected total variation along the uniform partition 
V(fPjj,Z) by V(fPjj,Z) = E[lim n V(Pj} n , Z)\. By the final remark in the previous definition, V T (Z) > 
V{Vl,Z). 

Suppose e > 0 and T = (e, T). 


Theorem 4.1. Suvvose Assumvtions UpR 12. £1 and \4-l\ hold. LetY be any martingale with marginals 
Then, 


V t {Y) > V{fPjj\Y) > dt dxip t {x)p(t,x) = dt ip t (x)q t (dx). 


(9) 


Proof. We only need to prove the second inequality. Let [f] n = maxjtfc £ P^ ; tk < t}. Then [t] n f t and by 
Corollary 14. 2 1 for s < t we have, E[|Y t — Yg|] > E[^(s, Y t ) — ip{s, Y s )], 

- 2 n 


V(K, n ,Y) = 


= E 


£ ip- - P- 

lk=l 
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> 


k =1 

E /^^(4-”^) p(tk ,n , x ) -p(tk-v x ) 


k=l" 1 

= J dx J ip([t\ n ,x)dtp(t,x)dt 
—* J dx J dtrp(t,x)p(t,x)dt 

where we use the fact that ij)([t\ n ,x) —> ip(t,x), ip(t,x) < \x — xo|, f T f (\x\ + l)\p(t,x)\dx < oo and dominated 
convergence. 

□ 


It remains to show that this bound is best possible, or equivalently for the process constructed in this 
section there is equality in ©. This follows from the fact that L t (x, y) = 0 for y £ {a(t , x), x, b(t, a;)}. 

Theorem 4.2. Let X be the process constructed in Section^ using the family of martingale transports given 
in ([?p. Then 

T t (X) = J dt J dxif(t,x)p(t,x). (10) 

Hence in the class of martingales with marginals X minimises the expected total variation. 

Proof. Firstly note that from easy properties about zero mean random variables taking only two values 


E [ \X t +dt — X t \\X t = x\ = Rate of jumps x dt x E [|AX t || Jump at t, X t = x] 

2 (b(t,x)-x)(x-a(t,x)) 

= H t (x) - —— --- dt 

b(t, x) — aft, x) 

Recall that by construction X is right-continuous, and almost surely has only finitely many jumps in 
T = [e, T], We may assume that the jump times are elements of the partition and then 


V T (X) =E 


E 


e<t<T 


,T it [ ( ,(t,x)fi,(x) 2(t(t d ) ~. x)(x ~“ (t - x)) Jx 


[ dt f 2 p(t,x) 
J e J E+ 


bft , a;) — aft, x) 
_ (bft, x) — x)(x — aft, x)) 


The result will follow if we can show that 


bft, x) — aft, x) 
_ (bft, x) — x)(x — aft, x)) 

dx. 


dx. 


We have 


dxip t (x)p(t,x) = 2 / p(t, x, 

J JE t bft, x) — aft, x) 

dx , ijjt(z)f>(t,x)dx = — / x)p(t, x)~dx + / i/j(t,y)p(t,y)dx. 
JE t je% 


(11) 

( 12 ) 
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Then, from the fact that L t (x,y) = 0 for y £ {at(x),x,bt{x)}, for y = a(t,x ) we have ipt(y) = V’( t,y ) = 
ip{t,x) + (x — y)( 1 — 0{t,x)) and for y = b(t,x) we have 4>t{y) = = ip(t,x) + (x — y )(—1 — 6(t,x)). 

Then, by Proposition 14.11 




Similarly, 


'V>rj(t) 


dyil>(t, y)p(t, y) = - dxa'(t,x)if(t,a(t,x))p(t,a(t,x)) 

JE t 


[ dx x) + (x - aft, x)) — (x — a(t, x))0(t, x)] J x) 

J Et b{t, x) — aft, x) 


x ) — <z(t, x) 
x — a(f, x) 


r x — #) 

dyi/j(t, y)p(t, y)= dx [V>(£, x) - (x - 6 (£, x)) - (x - b(t, x))0(t, x)] . - 7 -—rp(£, x) 

J Et b(t, x) — a(t, x) 


Substituting these last two expressions into (1121) we find that the terms involving ip(t,x) and d(£,x) cancel 
and (fill) follows. □ 


Remark 4.1. Let H t = J ij)(t,x)p(t,x)dx. Suppose that that f teT dt f x dx\ip(t, x)\p(t, x) < 00 and note that 
Assumption ll.lf dl implies f teT dt f x dx\ip(t,x)\\p(t,x)\ < 00 . Then 


H T = H f 


H f dt = IL 


dt 


ip(t, x)p{t , x)dx + 



ip(t, x)p(t, x)dx, 


and hence if Y = is a martingale with marginals then 


y T (y) > / if>T(x)p(T,x)dx■ — / ipe( x )p( e -i x)dx — / dt ijj(t,x)p(t,x)dx. 


(13) 


Now consider the case where e = 0 and T = (0, T). Results in this case follow on taking the limit e 4- 0. 


Corollary 4.4. Suppose / Q T (|x| V 1 )£t{dx) < 00 . Let Y = (i^)te[o,T] be any martingale with marginals 
(pt)te\o,T]! an d let X be the martingale constructed using the martingale transports in (7|). Then R[° ,t 1(F) > 
y[°> T ](X) = Jq dt J dxip(t,x)p(t,x) and in the class of martingales with marginals (pt)te[o,T]> X has minimal 
expected total variation. 


Suppose further that f ip e (x)p(e,x)dx A 0 and f Q dt f x dx\ijj(t,x)\p(t,x) < 00 . Then 

y[°. T ](y) > y[°’ T ]^Y) = J dt J ijj(t, x)qt(dx) = J ipT{x)pT(dx) — J dt J ip(t, x)p, t (dx). (14) 


4.4 The dual problem and a pathwise representation 

The inequalities © and <H3D are lower bounds on expected total variation. It is well known from the 
literature on model-free option pricing (see, for example, Hobson [TjJ, 2D] and Beiglbock and Nutz H) 
that such bounds are often related to pathwise martingale inequalities. Again the key to this result is the 
inequality L t (x,y) > 0. 

Note that in the following theorem we do not make any assumption that Z is a martingale, or that Z 
has the correct marginals. 
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Theorem 4.3. Suppose T = (e,T) with e > 0. Suppose ip{t,x) and 6(t,x) are continuous and bounded 
on T x I and suppose L t (x,y) = L^, ti g t (x,y) > 0 for all (x,y) £ I x I and all t £ T. Then for all paths 
Z{uj) = {Z u (uj)) uef 


V t (Z(uj)) > ip(T, Z T (w)) - ip(e, Z e (uj)) - / ip(u, Z u (u))du - / 6(u, Z u {u))dZ u (u). 


(15) 


Proof. Note that if the path is not of finite variation then the inequality is trivially satisfied. So fix ui and 
suppose Z = (Zt(id)) te j is of finite variation. We have L^ tt g t (Z t , Z t +h ) > 0 and hence 


'Pit, ^t+h) ~~ Z t ) — dpt, Z t )(Z t+h — Z t ) < | Z t+ h — Z t \. 


(16) 


Consider the uniform partition P^j n = {t ^’ n -0 < k < 2 n j where t^’ n = e + k2 n (T — e) and abbreviate t%’ n 
to t r f. Then, applying (THU) repeatedly, 

2 n 2 n 2" 

k =1 


k =1 


k =1 


k= 1 


The second sum on the top line telescopes. By monotonicity, as we take finer and finer uniform partitions, 
the term in the second line converges: Y^k=i I— Z t n_J —> V(^J, Z) < V T (Z). 

For the remaining terms, recalling [t\ n = max{t^ G Pj : tj < t}, and setting [£] n = min{tj? G Pj : > £} 

[ T o([u} n ,z lu]n )dz u ^ [ T e{u,z u )dz u 

fe=i 

by bounded convergence, since 0 is continuous and bounded and ^ is of finite variation, and 

E (i’itk’Zt”)-i>{tk-i,Zt»)) = j ip(u, Z [u]n )du ^ f ip(u, Z u )du 

k =l 


again by bounded convergence, using the continuity and boundedness of ip. 


□ 


Corollary 4.5. Suppose p £ C\o<t<rEt and po = Sjr- Ifip(t,x) and 9{t,x) are continuous and bounded on 
(0 ,T) x I then if Zq(uj) = ~p 

V [0 ' T1 (Z( W ))>^(T,Z t (u;))- [ ip(u, Z u {u)))du — f 0(u,Z u (w))dZ u (u). (17) 

Jo Jo 
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4.5 Self-similar marginals 

Suppose that in addition to being increasing in convex order, the family (pt)te[o,T] is self-similar in the 
sense that there exists a centred continuous random variable Z and some positive exponent a such that p t = 
C(t a Z). Examples include the Gaussian ('Examplc l2.ll) and continuous uniform ('Examplc l2.3l) specifications. 
Note that we must have po ~ 5 q. 

Let nz be the law of Z , with potential U z , and let Iz with endpoints {£ z ,fz} be the smallest interval 
such that the support of pz is contained in Iz- Let pz be the associated density and suppose that the 
product zpz{z) is continuously differentiable on Iz and that pz{z) + zp' z (z) is strictly positive on an open 
interval Ez = {Ie^e) with Ie < 0 < te and negative on (£z,£e) U ( te,tz )■ Suppose further that (1 + 
\z\ 2 )(p z (z) + \p' z (z)\)dz < oo. Finally, define ( z (z) = -a{p z {z) + zp! z {z)) and suppose sup lE<z<rE < 

K z for some finite constant K z . 

If p(t, y) and U(t, y ) are the density and potential of p t then we have p{t , y) = t~ a p z (yt~ a ) and U(t, y) = 
t a Uz{yt~ a ). Further, if we define Pz(y ) = \o-{U z {y) — yU z (y)}, then 

< 90 , y) ■= \;U(t, y) = t°‘- 1 P z {yt~ a ) = ^at°‘~ 1 {U z {yt~ a ) - yt~ a U' z {yr a )} 
is positive since U z is the potential of a centred random variable, and then, since P z = t;z, 

p{t, y ) = Q"{t,y) = r^ +1 \z(yt- a ). 


It follows that we can define Jt(dy) = p(t,y) dy and A t(dy) = p{t,y) + dy , and that m(t) = f p(t,y) dy 
7 Si z C {z)~dz and n(t) = t a_1 Jj z( z {z)~dz are both finite and well defined. 

By definition R t (y) = |J = < \Kz- 

For y G Ez let A(y) = Az{y) and B(y) = Bz(y ) with A(y) < y < B(y) attain the supremum in 


a<y<b 


f p z{y) + (b- y)P' z {y) - P z (b) - P z (a) 
1 b — a 


Then, as in Section [Til A : E z >->• (£z,£e) and B : E z i-»- (rg,rz) are decreasing. 
For y € E z define 


0 ( 2 /) 


r _H_ 

Jo B(z) — A(z) ’ 



2 y-B(z) - A{z) 
B(z)-A(z) 


Extend these definitions to y ^ E z by setting 


0 ( 2 /) 


^( 2 /) 


0 (A- 1 ( 2 /)); y<l E \ 

0 (S" 1 ( 2 /)); y>r E J 

'^(A~ 1 (y)) + (A~ x (y) — y)(l — Q(A~ 1 (y))); y < t E 1 
4'(B- 1 (2/)) + ( 2 /-B _1 ( 2 /))(l + 0 (B- 1 ( 2 /))); y > r E J 


( 18 ) 
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Finally, define a t {y) = t a A(yt “), b t (y) = t a B(yt “), 6 t {y) = @(yt a ) and ^t(y ) = t a ^(yt a ). Then, for 

y S E t = (t a e E ,t a rE ), 



and ipt(y) < ^t(x) + (2/ - x)6 t (x) + | y- x\. 

Then, for any process Y with marginals (pt)te[e,T ], 



with equality for the process X constructed in Section 12.21 using the martingale transports defined via 0. 
Letting e 4- 0 we obtain V^°' T ^{Y) > J dz^(z)(z(z)- 

Remark 4.2. Note that ip(t,a ;) = at° L ~ 1 ['i!(xt~ a ) — xt~ ai ^'(xt~ a )} < at a ~ 1 J where J = sup y |T( 2 /) — 
y^'(y)\. Then from the shape of 'F (concave, then convex, then concave again) we find J = max{'F(££) + 
(0(£ E ) + l)|£ £ ;|;4'(r B ) + (l-0(r_ E ))r B ;4' , (£ E )£ B -T(£ £ ;);4' , (r E )r E -4 , (r E )} < oo. Similarly sup x \6 t (x) \ = 
sup^ |0(x)| < 2. 

Hence on each interval of the form [e,T\, with e > 0 (fl5l) holds (and if e = 0 then (flTl) holds) and we 
have a pathwise inequality to complement the inequality in expectation. 

Example 4.1 (Brownian motion). For Brownian motion there do not exist explicit forms for 0 or 'F. 
Nonetheless we find V^ 0 ’ T ](y^ > CypT where C is a finite constant which can evaluated numerically: 



The inequality is tight; in particular, there exists a fake Brownian motion with finite total variation on [0, T\. 
Moreover C is bounded above by 



Here we use symmetry about zero, the bounds 0 < \F (y) < \y\, and the fact that (-ypz(y))' = {y 2 — 
l)e~ v / 2 /\/27r is negative on \y\ < 1. 


Example 4.2. Consider the continuous uniform example (Example 003) in which ~ U[—t,t\. In this case 
we can write down values for 0 and \F directly. Note that in (1191) below we have exploited the fact that 
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when x ^ [—1,1] the inequality L t (x,y) = 0 is only tight at y = x to give a simpler form for 0 than that 
given in m- 

Let ^(x) = x 2 A 1 and 0(x) = x/{_ 1<x<1 }. Then, for all 1 , 1 /el, 


^(y) < ^(x) + Q(x)(y - x) + \y - x\, 


(19) 


or equivalently, \y — x\ > 'L(y) — 'L(x) — Q(x)(y — x ). The inequality is easily proved by considering the 
different cases: for example, if— 1 < x < y < 1, 

®(x) + 0(x)(y - x) + \y - x\ = x 2 + x{y - x) + (y - x) = y 2 + (y - x)(l - y) > y 2 = y) 

and if—l<x<l<y, 

'L(x) + 0(x)(y — x) + \y — x\ = x 2 + x(y — x) + {y — x) = 1 + (x + 1 )(y — 1) > 1 = ’L(y). 

Now define ip(t,x) = t^(x/t) and 9(t,x) = ®{x/t). It follows that 


i’(t,y) = t'if(y/t) < t jyx/t) + e(x/t) + 

= Ip(t,x) + 9(t 1 x)(y - x) + \y - x\. 


y _ x 
t t 


Let Y = ( Yt)o<t<T be any martingale with uniform marginals Y t ~ U[—t,t\. Then, disregarding for the 
moment that the regularity elements of Assumption 14.11 are not satisfied, from (j9j) we have 


y[°> T l(F) >lim [ dt f —q t (d: 
£ 1° Je J t 


x ) = I dt 

Jo 


2 1 2 1 

or alternatively, using the second representation in (1141) 


1 1 f l x 2 dx 


1 

1 - - 
3 


2 T 

dt = T’ 


( 20 ) 


i r T 

V^ t \Y)>— x 2 dx+ dt 

J —T 1 J 0 J—T 


rT ~ 2 dx 2 T 


T 2 2 T 


To give a direct proof of this bound we argue as follows: omitting the superscript J7, n on t 


U,n 
k > 


V [0 ’ T] (Y) > V(P ^ ] , Y) > E 


k =1 
2 n 


Y 2 Y 2 

tk Atn- —Atfe-i 


tk -1 

f tk dy y 2 
—tk ^tk ^k— 1 
ftk 


tk— 1 


A tk -1 — 


rtk-i 


'tk 


dy 
_i tk 


rtk-i 


tk -1 + 


dy 


1 


E 

k =1 

2 n 

E 

fc=i 
2 71 

y 

tfc tfctfc-i 3 

AC—1 


y 2 

i tfc—l 


1 


tfc tk—l J tk—1 


y 


tk—l(tk tk— l) (tfc tk—l) tk —1 


fc=l 


2T 

T' 
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